HTML Metadata Extractor (OG, Twitter Cards, Schema.org)
Pricing
Pay per usage
HTML Metadata Extractor (OG, Twitter Cards, Schema.org)
Extract structured metadata from any URL: title, description, OpenGraph (og:title/image/type/url/site_name), Twitter Cards, canonical, favicon, JSON-LD schema.org, language, h1 count, images, links. Used for link previews, SEO audits, content cataloging. $0.005/URL.
Pricing
Pay per usage
Rating
0.0
(0)
Developer
Hojun Lee
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
2 days ago
Last modified
Categories
Share
HTML Metadata Extractor
Extract structured metadata from any URL: title, description, OpenGraph (og:*), Twitter Cards, canonical, favicon, JSON-LD schema.org, language, h1 count, images, links. Used for link previews, SEO audits, content cataloging. $0.005 per URL.
Why this exists
Whenever you build a link-preview feature (Slack-style unfurl, Discord embed, content sharing), you need to call the same dozen meta-tag lookups. This actor does it in one call, returning everything you'd ever want from <head>.
What you get per row
Title / description / language
| Field | Example |
|---|---|
title | Apify: Full-stack web scraping... |
meta_description | Cloud platform for web scraping... |
meta_keywords | web scraping, automation, ... |
html_lang | en |
domain | www.apify.com |
canonical_url | https://www.apify.com/ |
OpenGraph
| Field | Example |
|---|---|
og_title | Apify |
og_description | ... |
og_image | https://...og.png |
og_type | website |
og_url | https://... |
og_site_name | Apify |
og_locale | en_US |
Twitter Card
| Field | Example |
|---|---|
twitter_card | summary_large_image |
twitter_title | ... |
twitter_description | ... |
twitter_image | https://... |
twitter_site | @apify |
twitter_creator | @apify |
Article-specific
| Field | Example |
|---|---|
article_published_time | 2026-06-09T14:00:00Z |
article_modified_time | 2026-06-10T10:00:00Z |
article_author | Jane Doe |
article_section | Engineering |
article_tags | ["python","scraping"] |
JSON-LD schema.org
Full array of structured-data objects found on the page (Article, Product, FAQPage, etc).
Structural stats
| Field | Example |
|---|---|
h1_count | 1 |
first_h1 | Welcome to Apify |
image_count | 34 |
link_count | 127 |
Quick start
Single URL
{"url": "https://www.apify.com/"}
Batch
{"urls": ["https://www.apify.com/","https://docs.apify.com/","https://www.techcrunch.com/"]}
Pricing
Pay-Per-Event: $0.005 per URL.
| Run | URLs | Cost |
|---|---|---|
| Single | 1 | $0.005 |
| Batch of 100 | 100 | $0.50 |
| Daily 1K SEO audit | 1000 | $5.00 |
Use cases
- Slack/Discord link unfurl — Power your own bot's URL preview
- SEO competitor audit — Pull OG / Twitter / canonical from 100 sites at once
- Content catalog — Build a database of article metadata from your RSS sources
- Schema.org validation — Verify structured data is present on key pages
- Newsletter aggregation — Get clean previews for each linked article
Related actors (same author)
- Web Page → Markdown Converter — Get the article body too
- Sitemap URL Discovery — Find all URLs to extract metadata from
- PDF Text Extractor
- JSON Schema Generator
Feedback
A short review helps SEO / content engineers find it: Leave a review on Apify Store